Selecting a model of nucleotide substitution
نویسنده
چکیده
Models of nucleotide substitution are commonly used in the analysis of DNA sequences. In this unit a protocol is described for the use of the program MODELTEST (coupled with PAUP*) to find the best-fit model of substitution for the sequence alignment at hand. An example data file is analyzed and the interpretation of the results is discussed. Some background theory on model selection and a discussion of the relevance of models is included at the end of the unit. Choosing the best-fit model of nucleotide substitution Models of nucleotide substitution are commonly used in the analysis of DNA sequences, especially in the estimation of evolutionary parameters and in the construction of phylogenetic trees. A common problem that a researcher has to face is the objective selection of such a sample. The program MODELTEST (Posada and Crandall, 1998) offers two sound statistical procedures to carry out this choice. This protocol describes the joint use of the program MODELTEST and PAUP* (Swofford, 2000) to select the best-fit model of nucleotide substitution for a given set of aligned DNA sequences. The three basic steps within this protocol consist of the estimation of a phylogenetic tree, the estimation of likelihood scores for the candidate models, and the selection of a particular model given these likelihood scores. The first two procedures are carried in the program PAUP* with a batch file included in the MODELTEST package. The discussion in this unit summarizes the theory behind the procedure and provides several caveats and guides for interpretation of the results. Particular comments about the implementation different platforms are also given. Users are encouraged to download the most current versions from the MODELTEST and PAUP* Web site USING MODELTEST AND PAUP* TO SELECT A MODEL OF NUCLEOTIDE SUBSTITUTION
منابع مشابه
Selecting the best-fit model of nucleotide substitution.
Despite the relevant role of models of nucleotide substitution in phylogenetics, choosing among different models remains a problem. Several statistical methods for selecting the model that best fits the data at hand have been proposed, but their absolute and relative performance has not yet been characterized. In this study, we compare under various conditions the performance of different hiera...
متن کاملSelecting models of evolution THEORY
Phylogenetic reconstruction is regarded as a problem of statistical inference. Because statistical inferences cannot be drawn in the absence of a probability model, the use of a model of nucleotide or amino-acid substitution – an evolutionary model – becomes necessary when using DNA or amino-acid sequences to estimate phylogenetic relationships among organisms. Evolutionary models are sets of a...
متن کاملStatistical comparison of nucleotide, amino acid, and codon substitution models for evolutionary analysis of protein-coding sequences.
Statistical models for the evolution of molecular sequences play an important role in the study of evolutionary processes. For the evolutionary analysis of protein-coding sequences, 3 types of evolutionary models are available: 1) nucleotide, 2) amino acid, and 3) codon substitution models. Selecting appropriate models can greatly improve the estimation of phylogenies and divergence times and t...
متن کاملWhen is it safe to use an oversimplified substitution model in tree-making?
The choice of an "optimal" mathematical model for computing evolutionary distances from real sequences is not currently supported by easy-to-use software applicable to large data sets, and an investigator frequently selects one of the simplest models available. Here we study properties of the observed proportion of differences (p-distance) between sequences as an estimator of evolutionary dista...
متن کاملSingle Nucleotide Polymorphism Analysis of Protamine Genes in Infertile Men
Background Single nucleotide polymorphism (SNPs) are considered as one of the underlying causes of male infertility. Proper sperm chromatin packaging which involves replacement of histones with protamines has profound effect on male fertility. Over 20 SNPs have been reported for the protamine 1 and 2. MaterialsAndMethods The aim of this study was to evaluate the frequency of two previously repo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003